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A resurgence of interest in multiple hypothesis testing has oc- 
curred in the last decade. Motivated by studies in genomics, microar- 
rays, DNA sequencing, drug screening, clinical trials, bioassays, edu- 
cation and psychology, statisticians have been devoting considerable 
research energy in an effort to properly analyze multiple endpoint 
data. In response to new applications, new criteria and new method- 
ology, many ad hoc procedures have emerged. The classical require- 
ment has been to use procedures which control the strong familywise 
error rate (FWE) at some predetermined level a. That is, the prob- 
ability of any false rejection of a true null hypothesis should be less 
than or equal to a. Finding desirable and powerful multiple test pro- 
cedures is difficult under this requirement. 

One of the more recent ideas is concerned with controlling the false 
discovery rate (FDR), that is, the expected proportion of rejected 
hypotheses which are, in fact, true. Many multiple test procedures 
do control the FDR. 

A much earlier approach to multiple testing was formulated by 
Lehmann [Ann. Math. Statist. 23 (1952) 541-552 and 28 (1957) 1- 
25]. Lehmann's approach is decision theoretic and he treats the mul- 
tiple endpoints problem as a 2 k finite action problem when there are 
k endpoints. This approach is appealing since unlike the FWE and 
FDR criteria, the finite action approach pays attention to false accep- 
tances as well as false rejections. In this paper we view the multiple 
endpoints problem as a 2 k finite action problem. We study the pop- 
ular procedures single-step, step-down and step-up from the point of 
view of admissibility, Bayes and limit of Bayes properties. For our 
model, which is a prototypical one, and our loss function, we are 
able to demonstrate the following results under some fairly general 
conditions to be specified: 
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(i) The single-step procedure is admissible. 

(ii) A sequence of prior distributions is given for which the step- 
down procedure is a limit of a sequence of Bayes procedures. 

(iii) For a vector risk function, where each component is the risk 
for an individual testing problem, various admissibility and inadmis- 
sibility results are obtained. 

In a companion paper [Cohen and Sackrowitz, Ann. Statist. 33 
(2005) 145-158], we are able to give a characterization of Bayes pro- 
cedures and their limits. The characterization yields a complete class 
and the additional useful result that the step-up procedure is inad- 
missible. The inadmissibility of step-up is demonstrated there for a 
more stringent loss function. Additional decision theoretic type re- 
sults are also obtained in this paper. 

1. Introduction. A resurgence of interest in multiple hypothesis testing 
has occurred in the last decade. Motivated by studies in genomics, microar- 
rays, DNA sequencing, drug screening, clinical trials, bioassays, education 
and psychology, statisticians have been devoting considerable research en- 
ergy in an effort to properly analyze multiple endpoint data. In response to 
new applications, new criteria and new methodology, many ad hoc proce- 
dures have emerged. The classical requirement has been to use procedures 
which control the strong familywise error rate (FWE) at some predeter- 
mined level a. That is, the probability of any false rejection of a true null 
hypothesis should be less than or equal to a. Finding desirable and powerful 
multiple test procedures is difficult under this requirement. Two useful tools 
for the construction of such multiple level-a tests are the closure principle 
[see Marcus, Peritz and Gabriel (1976), as well as Hochberg and Tamhane 
(1987)] and the partitioning principle [see Stefansson, Kim and Hsu (1988) 
and Finner and Strassburger (2002)]. These tools can be used to generate 
large classes of multiple test procedures satisfying the FWE criterion. 

One of the more recent ideas is concerned with controlling the false dis- 
covery rate (FDR), that is, the expected proportion of rejected hypotheses 
which are, in fact, true. Many multiple test procedures do control the FDR. 
See, for example, Benjamini and Hochberg (1995), Benjamini and Yekutieli 
(2001), Efron, Tibshirani, Storey and lusher (2001) and Sarkar (2002). This 
criterion is particularly appealing if the number of endpoints is large. In some 
modern applications this number can be in the thousands. A summary of 
studies on multiple endpoint methods used with microarray data is given in 
Dudoit, Shaffer and Boldrick (DSB) (2003). 

The closure and partitioning principles tend to be linked to the step-down 
approach described in Hochberg and Tamhane (1987) and studied exten- 
sively in the literature. FDR was initially linked to the step-up approach. 
See Hochberg (1988). More recently, step-down and combined step-down 
with step-up methods have been viewed from an FDR point of view. Sarkar 
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(2002) notes: "While the FDR has been receiving increasing attention by 
researchers in different fields of statistics, theoretical progress has not been 
made at a similar pace." Sarkar's remark applies to the entire area of mul- 
tiple endpoint testing. Finner and Strassburger (2002) say: "Further and in 
general difficult problems are the comparison of different multiple test pro- 
cedures and the related questions concerning admissibility." They go on to 
say: "A serious issue is optimality and admissibility of multiple decision pro- 
cedures." DSB (2003) remark "Optimality of multiple tests is an interesting 
research avenue to pursue from both a theoretical and a practical point of 
view." 

A much earlier approach to multiple testing was formulated by Lehmann 
(1952, 1957). Lehmann's approach is decision theoretic and he treats the 
multiple endpoints problem as a 2 k finite action problem when there are k 
endpoints. The formulation as a 2 k action problem is particularly appealing 
since in terms of what is desired, one wishes to decide whether to accept or 
reject for each of the k hypotheses posed. This approach entails the specifi- 
cation of losses, which can be quite general. Lehmann (1952, 1957) was able 
to demonstrate some optimality properties for the single-step procedure and 
step-down procedure in two-dimensional problems for some hypotheses and 
for some restricted classes of procedures. Methods developed through the 
years to further the theory of testing a single hypothesis (a two-action prob- 
lem) do not extend easily to multiple actions and little progress has been 
made for this model. Nevertheless, the potential and importance of this ap- 
proach are compelling since the evaluation of methodologies and procedures 
is wanting and necessary in this subject. Little is known about properties 
of the various procedures and rigorously studying the underpinnings of the 
methodologies is essential. Furthermore, unlike the FWE and FDR criteria, 
the finite action approach pays attention to false acceptances as well as false 
rejections. 

Our approach will be to regard the problem as a 2 k action problem. 
We carefully distinguish between what is known as the global problem and 
multiple endpoints problem. We are very precise about what null hypotheses 
and what alternative hypotheses are to be considered. Several notions of 
monotonicity of procedures and monotonicity of risk functions have been 
introduced and studied in Cohen and Sackrowitz (CS) (2004). In this paper 
Bayes procedures, limits of Bayes procedures and admissibility results of 
procedures are studied. In particular we examine single-step, step-down and 
step-up procedures. We note that DSB (2003) classify the 18 procedures 
they study as single step, step-down or step-up. We consider loss functions 
that are sums of losses for the individual endpoints. 

We confine our study to a simple but prototypical model, although many 
of the results would remain true for other models. The model assumed is 
that we observe a(fcxl) random vector Z which is assumed to be /c-variate 
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normal with mean vector /x and known covariance S. Among the results are 
the following: 

Result 1.1. Suppose the covariance matrix £ is of the intraclass type, 
that is, all variances equal, all covariances equal. Then under some mild 
conditions the single-step procedure is admissible. The approach used to 
prove admissibility is somewhat new. 

Result 1.2. If £ = a 2 I, the step-down procedure studied is shown to 
be a limit of a sequence of Bayes procedures. 

Result 1.3. Suppose £ is intraclass, and p is the common correlation 
coefficient between any pair of variables. Consider a vector risk (VRI) where 
the components of the vector are the risks for the individual testing prob- 
lems. Then the single-step procedure is admissible for any — 1 < p < 1. The 
step-down and step- up procedures are admissible if and only if p > 0. As a 
corollary it follows that for p < 0, step- up and step-down are inadmissible 
for the loss function which sums the losses for the individual component 
problems. 

In Section 2 we state the models, distinguish between global test prob- 
lems and multiple endpoint testing problems, introduce the loss functions, 
describe the various properties of procedures and give other preliminaries. 
In Section 3 we describe the single-step, step-down and step-up procedures. 
In Sections 4 and 5 we state properties of these procedures. All proofs are 
given in the Appendix. 

2. Models and preliminaries. 

2.1. Models. Let Z be a (k x 1) random vector which is A;-variate nor- 
mal with mean vector /j, and known covariance £. One global one-sided 
hypothesis testing problem is 

(2.1) # (G) :/x = vs K (G) :/i>0\{/x = 0}, 

that is, pi > 0, i = l,2,..., k, with at least one pi > 0. Such a problem is 
distinguished from a one-sided multiple endpoints problem in which one 
tests 

(2.2) Hi-.pi = vs Ki-.pi>0, i = l,2,...,k. 

That is, the latter problem is a 2 k action problem where one selects an action 
to either accept or reject Hi, i = 1, 2, . . . , k. 

Note that another form of the one-sided multiple endpoints problem is 



(2.3) 



H* : pn < vs Ki : p, t > 0. 



MULTIPLE COMPARISON PROCEDURES 



5 



In the multiple endpoints literature there are ample instances of both the 
Hi and H* problems. We mention both since from a decision theory point of 
view we will see that sometimes different results ensue depending on whether 
Hi or H* is being tested. In connection with distinguishing between Hi 
and H* we mention two practical situations where the multiple endpoints 
scenario arises. 

(I) Consider the problem of comparing k treatments with a control as- 
suming (often realistically) that the treatment mean will be at least as large 
as the control mean. [This model is called the tree order model in Robertson, 
Wright and Dykstra (1988).] Then if Zi represents the difference between a 
reading on the ith treatment and the control, Zi has mean //j where /ij > 0. 
Assuming all treatment and control observations are normal, independent, 
with variances 1, then Z is multivariate normal with mean vector /x and 
covariance matrix S. The covariance matrix S is (k x k) and 



(2.4) 



1/2 
1 



1/2 \ 
1/2 



\l/2 1 / 

The appropriate multiple hypotheses in this case are those in (2.2). 

Note that the (k x k) covariance matrix £ in (2.4) is a special case of a 
class of covariance matrices which are called intraclass. That is, a covariance 
matrix S = (cry) is intraclass if an are the same for all i = 1, 2, . . . , k, and aij, 
i = 1, . . . ,k,j = 1, . . . , k, j, are the same. When E is intraclass, then the 
normal variables are exchangeable. Note also that a special case of intraclass 
is when all an are the same and all aij = 0, i ^ j. In this latter case the Zi are 
independent. Another special case of intraclass is when k = 2, and an = ai2- 
The intraclass matrix £ may be written as 



(2.5) 



a 



(I P 
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\p ■■ 



P 
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with p restricted to the interval —l/(k — 1) < p < 1. See, for example, Kr- 
ishnaiah and Pathak (1967). 

(II) Let Xj, i = 1,2, be independent normal with mean vector Vi and 
known covariance Sj. Xi corresponds to a (k x 1) vector of measurements 
made on a control subject. X2 corresponds to a (k x 1) vector of mea- 
surements made on a treatment subject. Consider Z = X2 — Xi and note 
Z is multivariate normal with mean vector fi = Vi — V\ and covariance 
matrix £ = Si + T,2- If one feels that the treatment cannot decrease vu, 
i = 1, 2, . . . , k, then this is the classic multiple endpoints problem with (2.2) 
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as the multiple hypotheses. If one feels that the treatment can reduce vn as 
well as increase uu, then this is the classic multiple endpoints problem with 
(2.3) as the hypotheses. 

2.2. Preliminaries. A 2 k finite action problem has actions a = (ai, 02, • • • , 
where ai equals or 1 for i = 1, . . . , k. An action where aj = 1 means that Hi 
is rejected, where if ai = 0, Hi is accepted. Thus, for example, a = (1, . . . , 1)' 
means all Hi are rejected. It will be convenient to define 

T = {u : u = (u\, . . . , Ufe)', Ui = or 1, all i}. 

Note that T can be used to represent the totality of all actions. However, V 
will serve other purposes as well. 

Decision rules <5(-|z) are probability mass functions on T with the inter- 
pretation that 5(a|z) is the conditional probability of action a given z is 
observed. For each z, a nonrandomized decision rule chooses a single el- 
ement of r with probability 1 and assigns all other actions probability 0. 
Each decision rule 5 determines a set of test functions for the individual test- 
ing problems. These test functions are given by ip(z) = (V>i(z), . . . ,ipk(z))' 
where ipi( z ) IS the probability of rejecting Hi. A decision procedure <5(ajz) 
determines a set of if)f(z), i = 1, . . . , k, as follows: 

(2.6) <5(a|z) = ^a i 5(a|z), 

agAi aer 

where Ai = {a £ T : a has a 1 in the ith position}. Whereas <5(a|z) deter- mines 
the reverse is not true. If iZ'(z) is nonrandomized it uniquely determines some 
<5(a|z). The <5(a|z) determined is nonrandomized. 

For problem (2.2), we partition the parameter space O = {/i. : fii > 0, i = 
1, . . . , k} into 2 k sets il V5 v € T, where Q v = {/x : /x = (^1,^2, ■ ■ ■ , Hk)', and 
lii > if Vi = 1 and ^ = if Vi = 0, i = 1, . . . , k}. For problem (2.3), Q = R k 
and we have a similar partition but Vi = means /ij < 0. Also for problem 
(2.2) let fiW = {/i:/xeO,/i ! = 0}. 

A loss function is a function of the action taken and the true state of 
nature. We will take the loss function to be additive over the individual 
component problems and for each component problem we choose the loss as 
follows: zero loss for a correct decision; a loss of 1 for rejecting Hi when it 
is true and a loss of b for accepting Hi when it is false. The loss function for 
the finite action problem can be expressed as 

k k 

(2.7) L(a,fi) = ^2ai(l-Vi)+^2b(l-ai)vi, /i£fi v , 

i=l i=l 

with < b. This loss function reflects the property that the loss is additive 
over the losses for the component problems. 
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The risk function for a decision procedure 5 is 

(2.8) R(d,^)=E ll J2L(a,^)d(a\z). 

agr 

For the above loss function (2.7) it follows from (2.6) that the risk depends 
on 5 only through ip, so we can write (2.8) as 

k 

(2-9) fl(^, A 4) = X;i2( i )(^,M) ) 

i=l 

where 

(2.10) R(i)(^V) = {t , f i{ ^' , t ^ ^ = n 

The risk function (2.9) can be written as 

(2.11) J E /x (^'(l-v) + 6(l-^) / v), 

where 1 = (1, . . . , 1)'. 

A decision procedure ip is said to be inadmissible if there exists another 
procedure i/>* such that R(tp*,fi) < R(i/),/j,) for every /x with strict inequal- 
ity for some fi. Otherwise x]j is admissible. 

As previously noted we can view the multiple endpoints problem as one 
involving k endpoints in which ipi(z), i = 1, . . . , k, is a test function for the 
ith endpoint. In this scenario one may wish to consider a vector risk ap- 
proach where the risk consists of a (k x 1) vector R = (Rn\, . . . ,Rf^\)' , 
-R(j) = R^ij/j,) given in (2.10). In this formulation any procedure which 
has an admissible test for each single component is admissible in the vector 
risk formulation. For general results concerning vector risks, see Cohen and 
Sackrowitz (1984). 

3. Some procedures for multiple endpoint problems. We focus on three 
special cases of the most frequently discussed procedures, namely single-step, 
step-down and step-up. The three procedures are considered in Hochberg 
and Tamhane (1987) and Shaffer (1995). In all that follows we assume with- 
out loss of generality that the variance of each Zi is 1. Furthermore, for 
now for step-up and step-down we limit our discussion to procedures which 
are symmetric in the variables Z\, . . . , Zk, that is, procedures that are per- 
mutation equivariant. The normal model, with intraclass covariance matrix, 
represents the most general case of permutation invariance. 

3.1. Single-step. The single-step procedure we study is: 

Procedure 3.1. Reject Hi if and only if Zi > C{. 

The constants Cj are typically chosen so that the strong familywise error 
rate (FWE) is less than or equal to a. 
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3.2. Step-down. The step-down procedure we study is as follows: 

Procedure 3.2. Let < Zi 2 ) <■■■< Z^ be the order statistics for 
the set {Z\,Z 2 , . . . ,Z k )' and let Cj be a strictly increasing set of critical 
values: 

(i) If Zcja > C k , reject Hha. Otherwise accept all Hi. 

(ii) If H/ k \ is rejected, reject Hr k _i\ if Zn._y\ > C k ^\. Otherwise accept 
all . . . ,H(i)- 

(hi) In general, at stage j, if Zq\ > Cj, reject H^y Otherwise accept 
H {j) ,...,H {1 y 

The critical values may be chosen so that: 

Outcome 3.3. P{Z( k ) < C k } = 1 - a when all Hi are true; P{Z/ k _ 1 \ < 
Ck-i} = I — a, with excluded, and H^y, • • • ,H( k _y are true. That is, 
after one of the hypotheses is rejected, we consider a new problem with the 
(k — 1) remaining variables that correspond to those parameters not rejected 
at step 1. P{Z^ < Cj} = 1 — a, when (k — j) variables and their corre- 
sponding hypotheses are excluded and H^y , . . . , H(j\ are true. This choice of 
constants leads to control of the strong FWE. 

The step-down procedure results by applying the closure method. See 
Hochberg and Tamhane [(1987), Chapter 2, Section 4.1] for a description 
of this method. The method is used by Marcus, Peritz and Gabriel (1976). 
Finner and Roters (2002) note that this method strongly controls the FWE 
and they call such a testing method a multiple level-a test procedure. The 
step-down method rejects more H^s than the single-step procedure for the 
same a. The single-step procedure would use C k for Cj, all j = 1, . . . , k. 

Remark. Procedure 3.2 is one type of step-down procedure. Another 
type, used, for example, by Marcus, Peritz and Gabriel (1976), uses a like- 
lihood ratio test in applying the closure method. This results in a different 
step-down procedure. 

3.3. Step-up. The step-up procedure we study is as follows: 

Procedure 3.4. Let Zm < Zi 2 ) <■••< Zha be the order statistics for 
the set (Z\, . . . , Z k ) and let Cj be a strictly increasing set of critical values. 

(i) If Z(x) < C\, accept -ff(i). Otherwise reject all Hi. 

(ii) If H^y is accepted, accept H( 2 ) if Z( 2 ) ^ C 2 . Otherwise reject _ff( 2 ) , • • • , 

(iii) In general, at stage j, if Zu\ < Cj, accept H^y Otherwise reject 

H {j y...,H {k y 
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The critical values Cj are sometimes chosen so that: 

Outcome 3.5. P{Z {1) < Ci,Z (2) < C 2 ,...,Z {j) < Cj} = 1 - a (1 < j < 
k) when all p,i = 0, i = 1, . . . , k. 

This choice of constants enables control of the strong FWE. The step-up 
procedure is credited to Hochberg (1988). 

4. Properties of single-step. Recall b > and for £ intraclass, — l/(k — 
l)<p. 

Theorem 4.1. For problems (2.2) and (2.3), suppose T, is intraclass. 
Suppose the loss function is (2.7). Then the single-step procedure is admis- 
sible if p> —1/6. 

For the proof see Appendix A.l. 

Remark 4.2. The proof that the single-step procedure is admissible 
under the given conditions is accomplished by demonstrating it is uniquely 
locally admissible in some sense. In a one-dimensional, one-sided hypothesis 
testing problem Lehmann (1986) describes a unique locally best test as one 
whose derivative of the power function evaluated at the null point is largest 
among all size a tests. For the multivariate global testing problem (2.1) 
a test is uniquely locally best in a direction if it has a similar property 
as in the one-dimensional case. Marden (1982) utilizes the notion of local 
admissibility for global testing problems. For our finite action problem we 
introduce a notion of unique local admissibility and demonstrate that the 
single-step procedure has this property. As in the global testing problem the 
focus is on the point /z = and is linked to the derivative of a function of 
the risk evaluated at /i = 0. 

Note that the theorem applies to the tree order model since in that case 
p> 0. 

The admissibility result is particularly interesting in light of a result in 
CS (2004). There it is stated that when p> 0, no nontrivial Bayes test can 
be type-I monotone for problem (2.2). See CS (2004) for the definition of 
type-I monotone. The single-step procedure is type-I monotone, so a first 
guess might be that it is inadmissible. The result indicates that the first 
guess is incorrect. 

Next we have: 

Theorem 4.3. For problems (2.2) and (2.3), suppose T, = I (indepen- 
dence case). Suppose the loss function is (2.7). Then the single-step proce- 
dure is proper Bayes. 
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For the proof see Appendix A.l. 

This result should also be contrasted with the result in CS (2004) which 
states that the single-step procedure cannot be Bayes if £ is intraclass with 
p>0: 

Our final result of this section is: 

Theorem 4.4. For problems (2.2) and (2.3), suppose £ is intraclass. 
Suppose the risk function for each component problem is (2.10). Then the 
single-step procedure is admissible for the vector risk VRI described in Re- 
sult 1.3. 

For the proof see Appendix A.l. 

Although the single-step procedure has the above desirable properties, 
many feel that single-step procedures are too conservative. That is, they 
do not detect significant effects often enough while controlling the FWE. 
Single-step procedures are somewhat akin to some simultaneous confidence 
bound procedures which are highly conservative, making it difficult to de- 
clare significance for an individual endpoint. 

5. Properties of step-down and step-up. In this section the loss function 
is (2.7). 

We establish the following theorems: 

Theorem 5.1. For problems (2.2) and (2.3) for £ = I, there exists a 
sequence of prior distributions for which the step-down procedure is a limit 
of a sequence of Bayes procedures. 

Remark 5.2. In Theorem 4.1 it is shown that the single-step procedure 
has a limiting "local" optimality property. The limit point is 0. In Theorem 
5.1, however, it is shown that the step-down procedure has a limiting op- 
timality property, but now the limiting parameter points receiving weight 
tend to oo. 

Theorem 5.3. For problem (2.3), £ = I, k = 2, b = l, the step-down 
procedure is admissible. 

Theorem 5.4. For problems (2.2) and (2.3) suppose £ is intraclass. 
Then the step-down procedure is admissible for vector risk VRI if and only 
ifp>0. 

For the proof see Appendix A. 2. 

Theorems 5.3 and 5.4 and the proofs of these theorems also apply to the 
step-up procedure given in Procedure 3.4. The most interesting properties 
for step- up are given in the companion paper CS (2005). 
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APPENDIX 

A.l. Proofs of Theorems 4.1, 4.3 and 4.4. In order to prove Theorem 
4.1, we need a definition and theorem. First the definition. 

For each v 6 T, let Cl v be the closure of Q v . Let R v (ip, fi), for /i£fi v , be 
the continuous extension of R{ip, /x) for fi £ £l v . Note that the point £ fj v 
for all v 6 T. Since the risk function is continuous on each £l v , it follows 
that if if>* is better than then R v (ip*,n) < i? v (i/>,/^) for all /x £ f2 v - in 
particular, ip* better than tp implies i? v ('0*,O) < R v (rp,0) for all vef. 

The next theorem is useful when comparing decision procedures under the 
assumptions of this paper. That is, assume normality and assume the risk 
function is (2.11). In the case k = 1 (which is the usual one-sided hypothesis 
testing problem), the theorem reduces to the well-known result that if tp* is 
better than ■»/>, then their risks at zero (size of the test) must match. 

Theorem A.l. If if)* is better than ip, then R v (ip*,0) =R v (ip,0), all 

ver. 

Proof. The assumption that -0* is better than i/> implies that R v (il>* , 0) < 
Rv(i/>,0) for all v e T. Suppose R Q (ip*,0) < Ro(ip,0). Since R (ip,0) = 
E J2i=iM z ) and #iC0,O) = k & -£oEi=i^i(z), it follows that Ri(^,0) < 
i?i (?/>*, 0). This is a contradiction. A similar contradiction is reached if it is 
assumed that Ri(tp*,0) < Ri(ip,0). 

Now suppose for some v £ r r , r = 1, . . . , k — 1, R v (tp* , 0) < R v (tft, 0), 
where r r = {v G V : Y$=i Vi = r}. Then from (2.11) 

(A.l) RvW,0)= E W(l-v) + &(l-^)'v]. 

ver r ver r 

Now recognize that £ V Gr s v = (»Zi)l an d collect terms so that (A.l) 
equals 

k 

If iCr 1 ) ~ Kill)} > 0, then £o£„ err #v (</>*, 0) < £ E ve r r #v(V>,0) im- 
plies E Y$ =1 ipi(z) < E J2i=iM z )- Tms in tum implies that Ri(ip,0) < 
i?i(V>*,0). This is a contradiction. If [( fc ~ 1 ) - < or equals 0, a sim- 

ilar contradiction is reached. Thus the theorem is proved. □ 

To prove Theorem 4.1 we need to study the behavior of linear combina- 
tions of the i? v functions. When S is assumed to be intraclass we may write 
S = cr 2 ((l - p)I + pll'). In this case 

s-M^a-pr'u-Gii)', 



(A.2) 



bk 



k-l 



+ 



k-l 

V 



k-l 
k — r 
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where 

G = P /{l + {k-l)p). 
As earlier we take a 2 = 1 without loss of generality. 

Proof of Theorem 4.1. Let if)* be the single-step procedure. Suppose 
if) is better than if}*. Then Theorem A.l implies if) cannot be uniformly 
better than the single-step procedure at 0; that is, there does not exist a if) 
such that R v (if>,0) < R v (ip*,0) for all v, with strict inequality for some v. 

Therefore we need only consider procedures if) such that 

(A.3) R v (if),0) = R v (if>*,0) forallver. 

For if) satisfying (A.3) we study ^vRv(ip, A*), where A v are coefficients that 
can depend on /x, for v € T and where R v (if), fj,) is evaluated at fx = Av, A > 
0. In this case write X^ver •V-RvC'/'j A 4 ) = Svgr (A) i? v Av). Among if) 
satisfying (A.3), we show that if)* is the unique procedure that minimizes 
the derivative with respect to A of J] ver A v (A)i? v (?/>, Av), evaluated at 0. 
This demonstrates the admissibility of if)*. 

Now we consider ^ vgr A v (A)i? v (?/>, Av), which using (2.11) is 

(A.4) / "... /IE E A v (A)[l / (^ + 6v)-(l + 6)V'v]/(z|vA)Lz, 
J J U=over s J 

where /(z|vA) is obtainable from 

f(z\fl) = (l/(2^) fc /2j S |l/2 )e -(l/2)(z^)'E-l(z- M)> 

For a chosen set of A V (A) we seek a if), among the class of procedures 
satisfying (A.3) that minimizes the derivative of (A.4) with respect to A, 
evaluated at A = 0. 

Recall with a 2 = 1, 5T 1 = (1 - p)" 1 ^ - Gil'). 

Now we choose A v , v G T, as follows: 

Let C = (Ci, . . . , Cfc)', where Cj is given in Procedure 3.1. 

Let Si = (0, . . . , 1, 0, . . . , 0)', that is, a vector with all zeros except 1 in the 
ith position. Let 7 = [(1 - Gk) + (1 + b)G]/b(l - Gk) = (1 + bp)/b(l - p), 
and note that 7 > if and only if (1 + bp) > 0. Let A (A) = 1, Ai(A) = 
T e- c ' rllA , A V (A) = e- c ' s_lvA , for v = e h i = 1, . . . , k, and A V (A) = 
otherwise. 
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The derivative of (A. 4) with respect to A evaluated at A = is expressed 

as 



= (1/(1 -P))J-J jEW + bet) - (1 + Wet] 
l_1 x^(/-Gll')(z-C) 
+ jb{k - ^'l)k(l - Gk){z - C) l/(z|0) dz. 



We will choose i/>(z) to minimize the integrand in (A.5) for each z. Toward 
this end we evaluate the bracketed term on the right-hand side of (A.5), 
which becomes 



tl>'l[k(z - C) - k 2 G(z - C)} + b[k(z - C) - k 2 G{z - C)] 
- (1 + b)if>'(z - C) + (1 + b)kG^'l{z - C) 
+ jb(k - if>'l)k(l - Gk){z - C) 
= -(l + 6)^'(z-C) 

+ if>'lk(z - C){(1 - Gk) + (1 + b)G - 7 6(1 - Gk)} 
+ bk(z - C)[(l — G) + 7fc(l - Gk)]. 



At this point we recognize that by substituting the selected value of 7 in 
the bracketed term on the right-hand side of (A. 6), the term becomes 0. 
Hence to minimize (A. 6) we choose ^(z) = 1 if Z\ > Ci and choose ipi(z) = 
if Zi < Ci. This is the single-step procedure. Thus the single-step procedure 
is admissible for problem (2.2) [and for problem (2.3), since the same proof 
applies] if 7 > 0. But 7 > if (1 + bp) > which amounts to the given part 
of the theorem. □ 

In order to prove Theorem 4.3 we will need the following definition and 
theorem. 

A decision procedure if}* is Bayes with respect to a prior distribution £(/x) 



In connection with Bayes procedures, let q(uj\z) denote the posterior prob- 
ability of the subset u € f2, given z. Then the following theorem describes a 
Bayes procedure. 



(A.5) 




if 



E(R(!lf*,p)=wfE(R(il>,n). 
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Theorem A. 2. Consider the risk function in (2.9). The Bayes proce- 
dure is ip* = (ipl, . . . ,ipf,)' , where 



Proof. Since the loss function is additive, the sum of expected risks for 
the individual components is minimized by minimizing the expected risk 
for the individual components. The theorem follows by the same argument 
used for a single testing problem. See, for example, Mood, Graybill and Boes 
[(1974), page 417]. □ 

Proof of Theorem 4.3. Choose a prior distribution such that fix, . . . , Hk 
are independent. Then q(£l^\z) depends only on Zj. Furthermore, q(£l^\z) 
is a decreasing function of Z{. Use Theorem A. 2 and the fact that the prior 
can be chosen so that q(£l^\z) < b/(b+ 1) is equivalent to Zi > C%. □ 

Proof of Theorem 4.4. We need only show that the procedure is 
componentwise admissible. That is, we need only prove that the test for 
each Hi : Hi = vs Ki : fii > is admissible. It suffices to show that Z\ > C\ 
is an admissible test for Hi : \x\ = vs K\ : hi > 0. To prove this we note 
that the multivariate normal density is proportional to 



Letting y = S -1 z, (A. 7) can be written in exponential family form as 



A result of Matthes and Truax (1967) implies that any test of H\ : m = 
vs K\ : hi > which is monotone in yi for fixed (2/2, • • • ,Vk) is admissible. 
Here monotone means if y[ < y'{ and the test rejects for y[, then it must 
also reject for y'[ when y2, ■ ■ ■ ,Vk are fixed. 

Now note that the single-step procedure is of the form reject if z\ > C\. 
Since z = Sy, this can be expressed as 





otherwise. 



(A.7) 



g-Cl/^z'E-izg-a^M'E-VeM' 



/i(y)/3(M)e M ' y = / l (y)/?(/x)e J/im+ St 2 ^. 



k 



(A.8) 



yi + pJ2yj >Ci - 

3=2 



From (A.8) we see that the test for Hi is monotone in yi for fixed (3/2, ■ ■ ■ , Uk)- 

□ 
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A.2. Proofs of Theorems 5.1, 5.3 and 5.4. 

Proof of Theorem 5.1. A sequence of prior distributions will be put 
on various points of O v - The amount of prior probability on each point 
will be expressed as a ratio where the denominator is always expressed as 
D and D is the sum of numerator terms. The sequence of priors is as fol- 
lows: On O( 0) . ..,o) the numerator of the prior probability is 1. On fino,...^) 
the numerator, e fJ, ' fJ '/ 2 e~ Cknk , is put on fi\ = n k ; all other /i's are zero. On 
^(o,o,...,o,i,o,...,o)i where 1 is in the ith position, the numerator e fJ, ' fJ '^ 2 e~ Ckn 
is put on fit = n k ; all other /z's are zero. On 0(o ! ...,o,i,o,...,o,i,o,...,o)j where 1 is 
in the ith and jth positions, the numerator (l/2)e fl W 2 e -C fc n fe -c fe _in fc 1 j g 
put on the points /ij = n k , fij = n k ~ l and ^ = n k ~ 1 , fj,j = n k (all other /x's 
zero). On Om... ) o,i,o,...,o,i,o,...,o,i,o,...,o)) where 1 is in the ith, jth and ith po- 
sitions, the numerator (1 /3!) e MV/2 e -C fe n fc -c , fe_„in fe 1 -c k - 2 n k 2 - 1S p U ^ on s j x 
points, namely, (j* = n k ,^ = n k ~\^ = n k ~ 2 ), (j* = = n k ~\^ = 

n fc_1 ), (m = n k - 1 ,[j,j = n k ,ne = n k - 2 ), (m = n fc_1 , fij = n k ~ 2 , m = n k ), (/x; = 
n k ~ 2 ,fij = n k ,^e = n fc_1 ), (/Xj = n k ~ 2 ,fij = n fc_1 ,^ = n k ) (all other /i's are 

zero). In general, if v G r s , then the numerator (l/s!)e /x ' /J, / 2 eS»=i c Wi-i n( + 
is put on s! points where the /z's are zero except for (fJ,j 1 , ■ ■ ■ ,f^j s ) and all 
permutations where fj,j 1 , . . . , Hj s correspond to Vj x , . . . , Vj a which are 1. 

Next we indicate the numerators of posterior probabilities for each £l v . All 
posterior probabilities have the same denominator. We will note that for each 
fixed z one of the posterior probabilities will tend to 1. This fact means that 
the posterior risk will be minimized by choosing the action that corresponds 
to the O v whose posterior probability tends to 1. We will see that such a 
choice will correspond to the step-down procedure. Here are the numerators 
of the posterior probabilities denoted by ^(O v | z )- All denominators of the 
posterior probabilities are the same and the denominator is the sum of the 
numerators: 

£(tyo,...,o)l z ) = !> 
m(i,o,...,o)\z) = e {zi - Ck)n \ 
?(^(o,...,o,i,o,...,o)l z ) = ^" Cfc)n ' 
C(n ( o,...,o,i,o,..,o,i,o,..,o)l») = (l/2)[e^- c ^ k+ ^- c "-^ h - 1 

_i_ e (zj-C k )n k +(z i -C k - 1 )n k - 

For an arbitrary v € T s , 

(A.9) £(O v |z) = (1/s!) J2 exp(^(^-GVi-*)n (fe+1) - 

all permutations V i=l 
of v Jl ,...,v js 
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where the indices jg reflect a permutation of Vj 1 , . . . , vj s . 

At this point fix z. Say, for example, and without loss of generality, Z{ > 
Cfc + i„j, i = 1, 2, . . . ,r, and Z{ < Cfc + i_j, i = r + 1, . . . , k. Then if r = 0, the 
posterior probability of ^(o,...,o) denoted by (7(^(0,. ,.,o)l z ) tends to 1 as n — > 
00. If r > 1, then (7(^(1,. .. ,1,0,. ..,0) l z )i with r ones in (1, . . . , 1, 0, . . . , 0), tends 
to 1 as n — > 00. This is true since £(f2 v | z ) tends to 00 (except for O( 0i ...,o)) 
as n — > 00, but the ratio of C(^(i,...,i,o,...,o)l z )/^(^v|z) where v differs from 
(1, . . . , 1, 0, . . . , 0) tends to 00 as n — ► 00. Thus we have demonstrated that 
the step-down procedure is a limit of a sequence of Bayes procedures. □ 

PROOF of Theorem 5.3. For problem (2.3) the risk is taken from 
(2.10) and (2.11) except now Ri(^i,fj.) = ^(^(z)) when [i% < 0. Let the 
step-down procedure be denoted by ?/> SD . Note that the risk function for an 
arbitrary procedure ip is as follows: 

For fii > 0, [12 > 0, 

(A.10) fl(^,A 4 )=2-E |4 (^i(z)+^ 2 (z)). 
For hi < 0, H2 < 0, 

(A.ll) RW,p) = Er(fc(*) + ih(z)). 

For \i\ > 0, H2 < 0, 

(A.12) R(iP,n) = l-E^ 1 (z) + E^ 2 ( Z ). 
For /xi < 0, H2 > 0, 

(A.13) i?(V,/x) = l-^2(z) + ^i(z). 

Now if -0 SD is inadmissible from (A.10)-(A.13), then there exists a -0* which 
is better, that is, 

(A. 14) E ( ,(ri(z) + r2(z))>E^f D (z) + ijl D (z)), Ml >0,/x 2 >0, 

(A. 15) ^P(^) + V 2 SD W)>^W( Z ) + V'2( Z )), Mi<0,M2<0, 

(A.16) ^(^(^-^(z))^^^^)-^*^)), W >0,/z 2 <0, 

(A.17) ^(^(^-^(z))^^^^^-^^)), /xi<0,/x 2 >0, 

with at least one strict inequality for some /z. By letting either fii — > or 
/i 2 -> or both m -> 0, ^ 2 -> in (A.14)-(A.17) we have that (A.14)-(A.17) 
hold whenever /Ui > 0, /i 2 > 0; /xi < 0, ^ 2 < 0; /ii > 0, /x 2 < 0; f/,1 < 0, /i 2 > 0, 
respectively. 

Consider parameter points of the form /x = (/xi,0)', /Ui > 0. In this case 
(A. 14) and (A.16) hold. Adding these two inequalities yields 

(A.18) E^*i(z)>E^t B (z). 
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Let W(z) = ipl(z) — tpf D (z) and let <p(u) be the standard normal density. 
Then (A. 18) is 



0< / / W(z) ( f)(zi-fx 1 )(j)(z2)dz2dz 1 

(A.19) 

W(z)<j){z2)<t>(zi )e Z1 ^ e~ c ^ e c ^ e~^ 2 dz 2 dzi . 



Equivalently we have, for all [i\ > 0, 



0<y J W(z)0(z2)0(zi)e^ 1 - C2 ^ 1 dz 2 dzi 
(A.20) poo poo 

<!+/ / W(z)c\ ) {z x )<\ ) {z 2 )e^- C ^dz 2 dz x . 



c 2 

In the last integral of (A.20) when z\ > C 2 , ^ SD (z) = 1. Thus for any 
i>*{z) W(z) < for all z\ > C 2 - If W(z) < on a set of positive Lebesgue 
measure, then the last integral in (A.20) tends to —00 as \i\ — > 00. This 
would be a contradiction and so W(z) = a.s. for z\ > C 2 . Thus for z\ > C 2 , 
ipi(z) = tpf D (z). This type of argument, letting fi — > 00 so that (A.20) — > 00, 
is due to Stein. See, for example, Stein (1956). 

In a similar fashion we show that 

V4(z)=V| D (z) for z 2 >C 2 , 
^(z)=V! SD (z) far*i<Ci, 

/.SD 



^(z)=V 2 bU (z) forz 2 <C 2 . 

Therefore ip*(z) = tp(z) whenever (z\ < C\,z 2 < C 2 ), (z\ < C\,z 2 > C 2 ), 
(zi > C 2 ,z 2 < d) and (z x > C 2 ,z 2 > C 2 ). That is, V>*(z) = ip SD {z) un- 
less C\ < z\ < C 2 or C\ < z 2 < C 2 . Next return to (A. 14) and consider 
H = We have 

(A.21) < £ M M(z) + V 2 *(z) - ^P(z) - V|°(z)}. 

Let V(z) = (^(z) + ^(z) -^i D (z) -^| D (z)). In the manner that (A.20) 
followed from (A.19), we have that (A.21) yields 

/OO /"OO 
/ V(z)4>{ Zl )4>(z 2 - l) e ^- c t)^ dz 2 dzi 
-co J —00 

<2+ / V{z)4>{z l )<t>{z 2 -l)e^- c ^dz 2 dz l . 

When z\ > C 2 and z 2 G [Ci,C 2 ] we have V{z) < 0. As before, we have a 
contradiction in (A.22) as /Ui — > 00 unless ?/'*(z) = 1 in {z:z\> C 2 ,C\ < 
z 2 < C 2 }. 

Similarly it can be shown that ip*(z) = ip SD (z) for almost all z not lying 
in the box 

(A.23) {z:C 1 <z 1 <C 2 ,C 1 <z 2 <C 2 }. 
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The final step is to show that ip*(z) = ?/> SD (z) on (A.23). Now ip SD (z) = 1 
on (A.23), so (A. 15) would be violated when fj, = if ip* (z) 7^ 1 on a set of 
positive measure in (A.23). This completes the proof. □ 

Proof of Theorem 5.4. As in the case of the proof of Theorem 4.4 
we appeal to the Matthes and Truax (1967) theorem. We must show that 
the step-down procedure is monotone in y\ for fixed (1/2, . . . ,yk) if and only 
if p > 0. 

Now recognize that the step-down procedure is of the form reject H± if 
z\ > C(z( 2 )), z^ 2 ) = (z 2 , • • • , or in terms of the coordinates of y it is of 
the form reject if 

k / k 

(A.24) « V ,=3 k ^ 

p(yi + V2) + yz + pYI ■ ■ ■ ' p Yl Vi + Vk ) • 

i=i i=l / 

Note that the left-hand side of (A.24) is increasing in y\ for fixed y 2 , ■ ■ ■ ,yk- 
We claim the right-hand side of (A.24) is nonincreasing in y\ for fixed 
y 2 , ■ ■■ as long as p > 0. To see this, note that C(z^) is a nonincreas- 
ing function of its arguments. That is, as any i = 2, . . . , k, increases it 
becomes easier to reject H\; that is, the critical value in the step-down se- 
quence can only remain the same or become smaller. For example, if all Zi, 
i = 2, . . . , k, are less than Ck, then C(z^) = Cfe. If exactly one of z 2 , ■ ■ ■ , 
is bigger than then C(z^) = C'k-i- Thus the conditions of the Matthes 
and Truax theorem are met and the step-down procedure (and step-up pro- 
cedure) are admissible for VRI as long as p > 0. 

Next we show that if p < 0, then the step-down (step-up) procedures are 
not monotone on some sections (monotone in y\ for fixed y 2 ,---,yk) and 
therefore can be improved on these sections. Toward this end recall that 

(A.25) y = S~ 1 z and z = Ey. 

Let r = (1 p p ■ ■ ■ p)' be the first column of £ and define the points z* and 
z** as follows: 

z* = ((Cfc_i + Cfc)/2, Ck, • • • , Ck), 
z** = z* — er. 

The step-down procedure accepts H\ when z* is observed (it is a boundary 
point of the acceptance region). Since p < when < e is sufficiently small, 
the step-down procedure will reject z**. It follows from (A.25) that 

y * = Sz* = £(z** + er) = £z** + eSr 

= y** + M,...,0)'. 
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Thus the step-down is not monotone in y\. A similar argument works for 
step-up. 

□ 

Acknowledgment. We are grateful to the Associate Editor who suggested 
a better version of Theorem 4.1 and who supplied a more useful weight 
function to be used in the proof. 
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